A Complete Tamil Optical Character Recognition System

نویسندگان

  • K. G. Aparna
  • A. G. Ramakrishnan
چکیده

The aim of the present work is to recognise printed Tamil text. Though commercial Optical Character Recognition (OCR) packages are available in the market for Roman Script, not much work has been done in the field of OCR for Indian languages. Indian scripts usually have a large number of symbols and hence, recognition is a challenging task. In the current context, a complete OCR in printed Tamil text has been developed. Attempt has been made to make it font and size independent. The methods involved can be extended to other Indian scripts as well.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Complete OCR System Development of Tamil Magazine Documents

We present an early version of a complete Optical Character Recognition (OCR) system for Tamil magazine documents. All the standard elements of OCR process like deskewing, preprocessing, segmentation, character recognition and reconstruction are implemented. Experience with OCR problems teaches that for most subtasks involved in OCR, there is no single technique that gives perfect results for e...

متن کامل

Feed Forward Back Propagation Neural Network based Character Recognition System for Tamil Palm Leaf Manuscripts

Optical character recognition refers to the process of translating segmented hand-written images or typewritten images into machine editable text. In this study, we propose a Tamil palm leaf manuscripts character recognition system using FFBNN technology. First the palm leaf manuscripts characters are segmented by exploiting the sliding window and adaptive histogram calculation. Afterwards, the...

متن کامل

A complete OCR for printed Tamil text

A Neural Network approach is proposed to build an automatic off-line handwritten Tamil character recognition system. We have used a Back Propagation Network (BPN) as a character recognizer. Once trained, the network has a very fast response time. However, the learning phase of this recognizer is a relatively difficult task in this application. The input image of the handwritten character is giv...

متن کامل

Embedded Optical Character Recognition On Tamil Text Image Using Raspberry Pi

Optical Character recognition is used to digitize and reproduce texts that have been produced with non-computerized system. Digitizing texts also helps reduce storage space. Editing and Reprinting of Text document that were printed on paper are time consuming and labour intensive. Optical Character recognition is also useful for visually impaired people who cannot read Text document, but need t...

متن کامل

Tamil Character Recognition Using Structural Features

In this paper we propose an approach for offline recognition of Tamil characters using their structural features. Structural features are the features that are physically a part of the structure of the character, such as straight lines, arcs, circles, intersections etc. The features used for recognition are the positions of vertical lines, horizontal lines and branching in a character. Some oth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002